tuoma haarnoja
- Asia > China > Beijing > Beijing (0.05)
- North America > United States > New Jersey > Mercer County > Princeton (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
vitchyr/rlkit
Reinforcement learning framework and algorithms implemented in PyTorch. To get started, checkout the example scripts, linked above. The initial release for 0.2 has the following major changes: Overall, the refactors are intended to make the code more modular and readable than the previous versions. These Anaconda environments use MuJoCo 1.5 and gym 0.10.5. You'll need to get your own MuJoCo key if you want to use MuJoCo.
Soft actor critic – Deep reinforcement learning with real-world robots
We are announcing the release of our state-of-the-art off-policy model-free reinforcement learning algorithm, soft actor-critic (SAC). This algorithm has been developed jointly at UC Berkeley and Google Brain, and we have been using it internally for our robotics experiment. Soft actor-critic is, to our knowledge, one of the most efficient model-free algorithms available today, making it especially well-suited for real-world robotic learning. We also release our implementation of SAC, which is particularly designed for real-world robotic systems. What makes an ideal deep RL algorithm for real-world systems?